Title: A Bayesian Approach to Discovering Truth from Conflicting Sources for Data integration Conference: VLDB 2012

نویسنده

  • Xiaofeng Xu
چکیده

Truth discovering is an interesting problem in data integration. In practical data integration system, it is common for the data sources being integrated to provide conflicting information about the same entity, thus raises the truth finding problem. The authors propose a Bayesian approach, the latent truth model, to solve the problem. The authors also conduct experiments regarding to both effectiveness and efficiency.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Bayesian Approach to Discovering Truth from Conflicting Sources for Data Integration

In practical data integration systems, it is common for the data sources being integrated to provide conflicting information about the same entity. Consequently, a major challenge for data integration is to derive the most complete and accurate integrated records from diverse and sometimes conflicting sources. We term this challenge the truth finding problem. We observe that some sources are ge...

متن کامل

A Probabilistic Model for Estimating Real-valued Truth from Conflicting Sources

One important task in data integration is to identify truth from noisy and conflicting data records collected from multiple sources, i.e., the truth finding problem. Previously, several methods have been proposed to solve this problem by simultaneously learning the quality of sources and the truth. However, all those methods are mainly designed for handling categorical data but not numerical da...

متن کامل

Integrating Conflicting Data: The Role of Source Dependence

Many data management applications, such as setting up Web portals, managing enterprise data, managing community data, and sharing scientific data, require integrating data from multiple sources. Each of these sources provides a set of values and different sources can often provide conflicting values. To present quality data to users, it is critical that data integration systems can resolve conf...

متن کامل

Information Integration: The MOMIS Project Demonstration

1 Overview The goal of this demonstration is to present the main features of a Mediator component, Global Schema Builder, of an I3 system, called MOMIS (Mediator envirOnment for Multiple Information Sources) 1]. MOMIS 12 has been conceived to provide an integrated access to heterogeneous information stored in traditional databases (e.g., relational, object-oriented) or le systems, as well as in...

متن کامل

Data Warehouse Configuration

In the data warehousing approach to the integration of data from multiple information sources, selected information is extracted in advance and stored in a repository. A data warehouse (DW) can therefore be seen as a set of materialized views defined over the sources. When a query is posed, it is evaluated locally, using the materialized views, without accessing the original information sources...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013